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SUBSHIFTS WITH SLOWLY GROWING NUMBERS OF 
FOLLOWER SETS 

THOMAS FRENCH, NIC ORMES, AND RONNIE PAVLOV 


Abstract. For any subshift, define Fx{n) to be the collection of distinct 
follower sets of words of length n in V. Based on a similar result proved 
in [4], we conjecture that if there exists an n for which |Fx(’T’)I < then 
X is sofic. In this paper, we prove several results related to this conjecture, 
including verifying it for n < 3, proving that the conjecture is true for a 
large class of coded subshifts, and showing that if there exists n for which 
\Fx(n-)\ < log 2 (n + 1), then X is sofic. 


1. Introduction 

Let Y be a subshift, i.e. a closed, shift-invariant subset of where A is some 
finite set. In this paper, we consider follower sets for words w appearing in X. By 
the follower set of w we mean the set of all one-sided infinite sequences s which may 
follow w in some point of X; see Section!^ for a formal definition. It is well known 
that the number of distinct follower sets in X is finite if and only if the subshift 
X is sofic [2]. In this paper, we consider the question of whether a sufficiently 
slow growth rate in the number of distinct follower sets for words of length n in X 
implies that X is a sofic subshift. 

More specifically, let Fx (n) denote the set of distinct follower sets in X for words 
of length n. We make the following conjecture: 

Conjecture 1.1. For a subshift X, if there exists n for which |Fx(n)| < n, then 
X is sofic. 

We are unable to prove this conjecture presently, but prove some supporting 
results in this paper. Firstly, we prove that if |Fx(n)| < log 2 (n-|-1) for some u > 1, 
then X is sofic (Theorem 14.71) . We prove a version of the conjecture relating to 
\Ui<n^x{ri)\ rather than \Fx{n)\ (Theorem 14.61) . We also prove Conjecture 11.11 
for u = 1,2,3 (Theorems 14.51 H31 and 14.101) . 

Conjecture 11.11 is motivated by a number of results, among them the following 
classical theorem of Morse and Hedlund. 

Theorem 1.2 ([3]). For a subshift X, if there exists an n such that the number of 
words of length n is less than or equal to n, then X is a finite collection of periodic 
points. 

An equivalent formulation is that a bound of n on the number of words of length 
n implies a uniform bound on the number of words of length n. With this phrasing, 
Conjecture 11.11 is equivalent to a version of Theorem 11.21 with “words of length n” 
replaced by “follower sets of words of length n”. 
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Another motivation is recent work of the three authors. In [I], building on 
an example of Delacourt, the first author showed that even when bounded, the 
sequence {|A'a:(i^)|} can exhibit some surprising behavior. In particular, he showed 
that while the sequence {|i^x(i^)|} is always eventually periodic, it is not necessarily 
eventually constant. In fact, the gaps between consecutive terms in the periodic 
portion can be arbitrarily prescribed. Even more closely related, the second and 
third authors proved in [4] that Conjecture II.II holds if follower sets are replaced 
by so-called extender sets. For a word w appearing in X, the extender set of w is 
the set of all pairs (p, s) of a left-infinite sequence p and a right-infinite sequence s 
such that the concatenation pws forms a legal point in X. Let Ex(n) denote the 
set of distinct extender sets for words of length n in X. 

Theorem 1.3 ([!]). Let X be a subshift. The following are equivalent. 

(1) X is Sofia 

(2) the sequenee {\Ex(n)W is uniformly bounded 

(3) for some n> 1, Ex(n) < n. 

It is then natural to ask whether the above holds for follower sets as well, though 
the question seems more difficult. 

We remark that one obvious approach would be to attempt to use Theorem 11.31 
above. In other words, one might attempt to prove that a small number of follower 
sets implies a small number of extender sets, and therefore soficity. Indeed, this still 
may be an avenue to a proof. However, in Example 13.11 we show that the sequence 
{|£'x(?r)|} may grow exponentially while {|Ex(n)|} grows linearly, meaning that 
this approach may not be enough on its own. 

2. Definitions and preliminaries 

We begin with a list of definitions. Let A denote a finite set, which we will refer 
to as our alphabet, elements of A will be referred to as letters. 

Definition 2.1. A subshift X on an alphabet A is some subset of A^ which is 
shift-invariant and closed in the product topology. 

Definition 2.2. A word over A is a member of A" for some n € N, which we call 
the length of w. We use 0 to denote the empty word, the word of length zero. 

Definition 2.3. For any words v G A" and w G A™, we define the concatenation 
vw to be the word in A"’*'’” whose first n letters are the letters forming v and whose 
next m letters are the letters forming w. 

Definition 2.4. For a word u G A", if u can be written as the concatenation of 
two words u = vw then we say that u is a prefix of u and that lu is a suffix of u. 

Definition 2.5. The language of a subshift X, denoted by L{X), is the set of all 
words which appear in points of X. For any finite n G N, define Ln{X) = L(Ai)nA", 
the set of words in the language of X with length n. 

Definition 2.6. For any subshift X on an alphabet A, and any word w in the 
language of X, we define the follower set of w in X, Fx{w), to be the set of all 
right-infinite sequences s G A^ such that the infinite word ws occurs in some point 
of X. (Note that Ex{0) is simply the set of all right-infinite sequences appearing 
in any point of A"). Similarly, we define the predecessor set of re in X, written 
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Px{w), to be the set of all left-infinite sequences p € A ^ such that pw occurs in 
some point of X. 

Definition 2.7. For any word w G L{X), we say that w is shortenable if there 
exists V € L(X) with strictly shorter length than w such that Fx(w) = Fx{v). 

Definition 2.8. For any subshift X over the alphabet A, and any word w in the 
language of X, we define the extender set of w in X, Ex{w), to be the set of all 
pairs (p, s) where p is a left-infinite sequence of symbols in A, s is a right-infinite 
sequence of symbols in A, and pws is a point of X. 

Definition 2.9. For any positive integer n, define the set Fx{n) = {Fx{w) \ w G 
Ln{X)}. Thus the cardinality \Fx{n)\ is the number of distinct follower sets of 
words of length n in X. Similarly, define Ex{n) = {Ex{w) \ w G L„(X)} and 
Px{n) = {Px{w) I w G Ln{X)}, so that \Px{n)\ and \Ex{n)\ are the numbers of 
distinct extender sets of words of length n in X and predecessor sets of words of 
length n in X respectively. 

Definition 2.10. A subshift X is sofic if it is the image of a shift of finite type 
under a continuous shift-commuting map. 

Equivalently, sofic shifts are those with only finitely many follower sets, that 
is, a shift X is sofic iff {Fjf(r(;) | u; in the language of A} is finite (See Theo¬ 
rem 3.2.10 of m)- The same equivalence exists for extender sets: X is sofic iff 
{Exiw) I w in the language of X} is finite. (See Lemma 3.4 of [3]) 

3. An example with many more predecessor sets than follower sets 

Example 3.1. There exists a subshift X such that for every n, |Fx(n)| = 2n -|- 1 
and for every n > 6, \Px{n)\ > 2L”/'^J. 

Proof. Define a labeled graph G as follows: the vertex set is E = = {0,1, 2,...}. 

From any vertex n are two outgoing edges: one leads to n -|- 1 and is labeled with U 
(for ‘up’), and the other leads to [n/2j and is labeled with D (for ‘down’), unless it 
is the lone self-loop in the graph from 0 to itself, in which case it is labeled with E 
(for ‘equals.’) Then, define a subshift X with alphabet {D,U,E} whose language 
consists of all labels of finite paths on G. For example, since from 17 one could 
follow D (to 8), D (to 4), U (to 5), U (to 6), D (to 3), D (to 1), D (to 0), and then 
E (to 0), DDUUDDDE would be a word in the language of X. An example of 
a word not in the language of X would be EUUDDD, since E must terminate at 
0, and then following U would take you to 1, another U to 2, D to 1, D to 0, and 
another D is not legal from 0. Note that G is right-resolving, i.e. given an initial 
vertex n and label a, there is at most one edge with initial vertex n labeled by a. 

We will need two auxiliary notations: for any w G L{X), denote by Tc{w) the 
set of terminal states of paths in G labeled by w, and by Ig{w) the set of initial 
states of paths in G labeled by w. Similarly, for any n G V, denote by Fain) the 
set of labels of right-infinite paths in G with initial state n, and by Pain) the set 
of labels of left-infinite paths in G with terminal state n. It should be clear that 
for any w, Fx{w) = ^g(r), and Px{w) = U„e/G(^) Pain). 

Claim 1: For every n, \Fx{n)\ = 2n + 1. 

Fix any n, and consider any w G Ln{X) which contains an E. Since E can only 
terminate at 0, and G is right-resolving, there is only one possible terminal vertex 
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of a path in G labeled by w, and so Tg{w) = {fc} for some fc, and correspondingly, 
Fx{w) = Faik). Since the largest vertex in V that can be reached from 0 via a path 
of length less than nisn — 1, 0<fc<n — 1. We claim that for fc < fc' G [0, n — 1], 
Fcik) 7 ^ Fcik'); if 2 ™ is the smallest power of 2 greater than k, then the reader 
may check that ~^~^D™'EEEE... is in E^k) but not Faik'). Finally, we 
note that for each k G [0,n — 1], the word w = has Tg{w) = {fc}, and so 

all n of the distinct follower sets Fb(0), Fg(1), ..., Fain — 1) are in Fx{n). 

Now, consider any w € Ln{X) which does not contain an E. We will prove by 
induction on n that Tg{w) = [fc,oo) for some 0 < k < n. The hypothesis is easy 
for n = 1; Tg{U) is clearly [1,(X)), and Tg{D) is similarly clearly [0,oo). Now, 
suppose that the inductive hypothesis is true for n, and consider w G Ln+i{X). We 
can of course represent w = w'a, where w' G Ln{X). By the inductive hypothesis, 
Tg{w') = [fc, oo) for 0 < fc < n. The reader can verify that if a = D, then 
Tg{w) = [[fc/2j,oo), and ii a = U, then Tg{w) = [k + l,oo), completing the 
inductive step. The proof is then completed, and so Eg{w) = lJi>fc ^g(*) for some 
k G [0,n]. 

We claim that for k < k' G [0,n], lJi>fc FG{i) ^ Ui>fc' again, if 2™ is the 

smallest power of 2 greater than fc, then U^"'~’^~^D"^EEE ... is in EG{k) but not 
FG{i) for any i > k'. We note that for each k G [0,n], the word w = has 

Tg(w) = [fc, oo), and so follower sets lJi>fc FgIi) are in Ex(n) for fc = 0,1, 2 ,..., n. 

There are therefore n distinct follower sets of words w G Ln{X) containing an 
E and n + 1 distinct follower sets of words w G Ln{X) not containing an E. To 
prove the claim that there are 2n + 1 in total, we verify that for all k, k' G [0,n], 
Ui>fc' FG{i) ^ FG{k). It k ^ k', we have already distinguished these sets by iden¬ 
tifying an element of FG(min(fc, k')) that is not in FgIi) for any i > min(A:, k'). If 
k' = k and 2™ > fc, then D"^~^^EEE... is in Fg( 2’") C lJi>fc'^G (*)5 but not in 
FG{k). This completes the proof of the claim. 

Claim 2: For every n > 6 , \Px{n)\ > 2 L”/‘*J. 

We will consider the set S of all w G {D,U, E}"^ which end with E, 

contain no other E, and do not contain consecutive U symbols. Clearly jS”! is the 
number of ([n/2j)-letter words on {D,U} without consecutive U symbols, which 
is greater than or equal to 2L"/4J (simply freely choose the first letter, force the 
second to be D, freely choose the third letter, and so on). Therefore, it suffices to 
show that S C and that for w ^ w' G S, Px{w) ^ Px{w'). We will verify 

both claims by the auxiliary claim that for any w G S, Ig{w) is a nonempty finite 
interval of integers, and that itw^w'GS, then Ig{w) 7 ^ Ig{w'). Clearly the fact 
that Ig{w) 7 ^ 0 will imply that w G Ln{X). Also, if Ig{w) ^ Ig{w'), then one 
can choose k G /g(ic)A/g('ic')- The reader may check that .. .EEEU’^ G PG{k) 
and is not in PG{k') for any k' 7 ^ k, and so Px{w) = Uig/gI™) Tg(*) and Px{w') = 
Uieiciw) Pcii) are distinct. 

It remains only to prove the auxiliary claim. Consider any w G S. We will work 
backwards from the end of w to determine Ig{w). First, write w = 

The reader may check that E) = [2i"Al-2^ which we write 

as [a, h) for brevity. Write v = v\n/ 2 \ ■ ■ ■ V 2 V 1 . Then, we will work from the right 
and state how each Vi will alter this interval. For instance, if vi = D, then 

E) is the set of all vertices which lead to a vertex in [a,b) via an 

edge labeled D, or [2a, 2b). Similarly, if vi = U, then E) is the set of 
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all vertices which lead to a vertex in [a, b) via an edge labeled U, or [a — 1, 6— 1). In 
fact, it is simple to see in the same way that each Vi will either double the endpoints 
of the interval (if Vi = D) or subtract one from the endpoints of the interval (if 
Vi = U). Since a = > [ n/2j (for n > 6), clearly neither endpoint will ever 

go below 0 in this procedure. 

This allows us to give a closed form for Ig{w). Since U does not change the 
length of the interval and D doubles it, clearly the length of Ig{w) is ‘2^{b — a), 
where j is the number of D symbols in v. The left endpoint of Ig{w), call it c, is 
obtained from a via a sequence of either doubling or subtracting 1, determined by 
whether the letters vi, V 2 , etc. are D or U respectively. Since no two consecutive 
Vi can be U, two subtractions in a row are not permitted. For instance, if a = 16 
and V = UDDDUD, then c = (((((16 • 2) — 1) • 2) • 2) • 2) — 1 = 247. It is not hard to 
check that this final answer could also be written as c = 2^a— (2"^ + 2"^ +... + 2"'“), 
where j is again the number of D symbols in v, k is the number of U symbols in 
V, and Ui is the number of D symbols preceding the ith U symbol in v. (Note that 
since v does not contain consecutive U symbols, {ui} is strictly decreasing). For 
instance, for v = UDDDUD, there are three D symbols preceding the rightmost U 
and zero D symbols preceding the leftmost U, and so ni = 3 and n 2 = 0, yielding 
c = 2‘^a — (2"i + 2"^) = 2'^ • 16 — 8 — 1 = 247. This yields the closed form 


Ig{w) = 


2^a-^2”%2^6- 


k 


Then, if w and w' have different numbers of D symbols, then the lengths of Ig{w) 
and Ig{w') are different, clearly implying that Ig{w) ^ Ig{w')- If w and w' have 
the same number of D symbols, then the choices of rn for w and w' are distinct 
(since the rii uniquely determine v), and by uniqueness of binary representation, 
the sum 2”* would take different values for w and w', again implying that 

Ig{w) yf Ig{w'). Therefore, all words in S have distinct predecessor sets in X, and 
we are done. 

□ 


We note that since predecessor sets are just projections of extender sets, this 
example clearly has \Ex{n)\ > 2L"/'^J for every n > 6 as well, illustrating that in 
general, the number of extender sets of words of length n may be much greater 
than the number of follower sets of words of length n. 


4. Main Results 

We begin with some simple facts about follower sets, which will repeatedly be 
useful in our analysis. The proofs are simple and left to the reader. 

Lemma 4.1. For any subshift X, any w G Ln{X), and any m € N, Fx{w) = 
[Jv ^xivw), where the union is taken over those v G Lm(,X) for which vw G 
Uni+ni^) ■ 

Lemma 4.2. For any subshift X, any w G Ln(X), and any m < n, there exists a 
V G Ljn{X) for which F{v) U F{w). 

Lemma 4.3. Let X be a subshift. If for two words w,u G L{X), Fx{w) = Fx{u), 
then for any v G L(X), Fx(wv) = Fx(uv). 
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The following will be our main tool for proving soficity of a subshift via the sets 
Fx{n). 

Theorem 4.4. For any subshift X, if there exists n G N such that Fx{n) C 
[J Fx{£), then X is sofic. 

l<n—l 

Proof. If there exists n G N such that Fxiji) C Fx{(), then for any word 

i<n—l 

w G Ln{X), the follower set Fx{w) is also the follower set of a strictly shorter 
word, so w is shortenable to a word of length strictly less than n. Now, let v G 
L{X) of length greater than n, say v = viV 2 ...VnVn+i---Vk where k > n. Then 
viV 2 ...Vn G Ln{X), and so is shortenable to some word v' G L{X) of length less than 
n. But FxiviV 2 ...Vn) = Fx{v') implies Fx{viV 2 -VnVn+l-'Vk) = Fx{v'Vn+l-Vk) 
by Lemma 14.31 so v is shortenable to a word v'vn+i.-.Vk- If v'vn+i-.-Vk has length 
less than n, stop, but if v'vn+i-.-Vk has length at least n, we may apply the above 
process again and shorten repeatedly, getting shorter and shorter words with the 
same follower set until we find one with length less than n. So n is shortenable to a 
word of length less than n. But this means that Uf<rt-i Fx{f) contains all follower 
sets in X, so X has only finitely many follower sets, and thus, X is sofic. □ 

We can now show that \Fx{n) \ = 1 for any n always implies soficity of X. 

Theorem 4.5. For any subshift X, if there exists n for which \Fx(n)\ = 1, then 
X is a full shift. 

Proof. We prove the contrapositive. Without loss of generality, assume that the 
alphabet A of X consists entirely of letters which actually appear in points of X, 
and assume that X is not the full shift on A. Then there exists a word w = 
W 1 W 2 . ■ .Wk & A^ which is not in the language of X\ suppose that the length k of w 
is minimal. It must be the case that k is at least 2, since we assumed that all letters 
of A are in L{X). Then since we assumed k to be minimal, W 2 . ■ .Wk G L{X), so 
we can choose some one-sided infinite sequence s appearing in X which begins with 
W 2 ■. - Wk. Similarly, wi is in L{X), so for any n G N, we may choose an n-letter 
word V ending with wi. Then vs contains w ^ L{X), so s ^ Fx(v). However, since 
s appears in X, there exists some n-letter word u which can be followed by s in 
X, and so s G Fx{u). Hence Fx{u) ^ Fx{v), so \Fx{n)\ > 2, and since n was 
arbitrary, this is true for all n. □ 

We can now prove a version of Coniecture ll.ll for unions of the sets Fx{n), rather 
than the sets themselves. 


Theorem 4.6. For any subshift X, if there exists n G N so that 
then X is sofic. 


U 

£<n 


< n, 


Proof. We prove the contrapositive, and so assume that X is nonsofic. By The¬ 
orem |431 |Fjs:(l)| > 2. Then, by Theorem 14.41 for every n > 1, there exists 


S G Fxin) \ and so 


> 


. Therefore, by 

induction, for each n. 


>n + l. 



□ 
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We may now prove the following, which establishes a logarithmic lower bound 
for the growth rate of |Fx(n)| for nonsofic shifts. 

Theorem 4.7. For any subshift X, if there exists n G N such that |Fx(n)| < 
log 2 (n + 1), then X is sofic. 


Proof. Suppose that for some n, Fx{n) = {Fi, F 2 , Fk} where k < log 2 (n+l). By 
Lemma im for each length i < n, every follower set of a word in Le(X) is a union of 
follower sets of words of length n. Therefore, every element of IJ^<n (•^) ^ 

empty union of elements of Fx{n). There are at most 2^ — 1 < 2 ^° 82 ("+i) _ 1 = 77 , 
such unions, so < n. which implies that X is sofic bv Theorem l4.61 □ 


Our next result shows that under the additional assumption that some non¬ 
empty word w has the same follower set as the empty word. Conjecture 11.11 is 
true. 


Lemma 4.8. For any subshift X, if there exists a non-empty word w G L{X) such 
that Fx(w) = Fx(0) and n G N such that |Fx(n)| < n, then X is sofic. 


Proof. The follower set of the empty word is the set of all right-infinite sequences 
appearing in any point of X. If there exists a word w such that any legal right- 
infinite sequence may appear after w, then by Lemma 14.21 there is a letter with 
this property as well. So we may assume that Fx{a) = Fx( 0 ) where a is a single 
letter. 


The fact that Fx(0) = Fx{a) implies by Lemma Td.31 that for every w G L{X), 
Fx{w) = Fx(aw) = Fx{aaw) = .... Therefore, every follower set of a word of 
length ^ is also a follower set of a word of any length greater than (.. In other 
words, Fx(l) C Fx(2) C Fx(3) C .... Then, for every n, Fx{n) = Uf<n^Y(^), 


and so if |Fx(n)| < n for some n, clearly 
sofic by Theorem 14.61 




< n, implying that X is 
□ 


Theorem 4.9. For any subshift X, if there exists n > 2 for which |Fx(7i)| < 2, 
then X is sofic. 

Proof. The case where |Fx(n)| = 1 is treated by Theorem 14.51 so we choose any 
n > 2 and suppose that there are exactly 2 follower sets in X of words of length 
n, say Fi and Fj. We consider the sets in Fx(l). By Lemma [4.11 every element 
of Fx(l) is either Fi, F 2 , or Fi U F 2 . If |Fx(l)| = 1, X is sofic by Theorem 14.51 
so assume that |Fx(l)| > 2 , that is, at least two of the above sets must appear 
in Fx(l). Note that F( 0 ) = \JweLxin) — -^1 U F 2 , so by Lemma 1^751 if 

Fi U F 2 is an element of Fx(l), then X is sofic. The only remaining case is that 
Fx(l) = {Fi,F 2 } = Fx(n), and then X is sofic by Theorem 14.41 □ 

We are now prepared to prove Coniecture 11.11 for n < 3. Our proof is much more 
complicated than the cases where n = 1 , 2 . 


Theorem 4.10. Let X be a subshift. If |Fx(n)| < n for any n < 3, then X is 
sofic. 

Proof. Clearly, for n < 3, Theorems 14.51 and 14.91 imply this result. We can then 
restrict to the case where n = 3. If |Fx(3)| < 3, then X is again sofic by either 
Theorem 14.51 or Theorem l4.9l We therefore suppose that |Fx(3)| = 3, say Fx(3) = 
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{Fi, ^2,^3}. We also note that F(0) = Fi UF2UF3, and if any of Fx{l), Fx(2), or 
Fv(3) contains F1UF2UF3 as an element, then X is sofic by Lemma [4^ Therefore, 
in everything that follows, we assume that Fi U Fj U F3 is not contained in Fx{i) 
for i < 3. 

We first show that if any F^ is contained entirely within another, then X is sofic. 
Suppose for a contradiction that some F^ is contained in another, and so without 
loss of generality, we say that F 2 Q Fi. By Lemma Td. 11 all elements of Fjf(2) are 
nonempty unions of Fi,F2, and F3. However, Fi U F3 = Fi U F2 U F3, and so 
Fi U F3 = Fi U F2 U F3 are not in Fx( 2 ) as assumed above. Also, Fi U F2 = Fi. 
Therefore, the only possible elements of Fjf ( 2 ) are Fi, F2, F3, and F2 UF3. If fewer 
than three of these four sets are part of Fx( 2 ), then X is sohc by Theorem 14.91 
Thus we may assume at least three of the four sets appear. If Fi, F2, and F3 are all 
in Fa ( 2 ), then Fx{3) C Fx( 2 ), implying that X is sofic by Theorem l4.4l Therefore, 
F2UF3 G Fx{‘2). We note that by Lemmasome element of Fjc( 2 ) must contain 
Fi. If F3 contained Fi, then F3 = F1UF2UF3 is in Fjc ( 3 ), which we assumed not to 
be the case above. Similarly, F2 U F3 cannot contain Fi. Therefore, Fi is the only 
set of Fi, F2, F3, and F2 U F3 to contain Fi, and so Fi G Fx(2). Therefore Fx( 2 ) 
consists of Fi, F2 U F3, and exactly one of F2 and F3. We note that if F2 U F3 is 
equal to any of Fi, F2, or F3, then either |Fjf( 2 )| = 2 or Fx( 3 ) C Fx( 2 ), in either 
case implying sohcity by either Theorem 14.91 or Theorem 14.41 So from now on we 
assume F2 U F3 is not equal to Fi, F2, or F3. 

Now, let us consider Fjf(l). By Lemma l4Tl Fjf(l) can only consist of unions 
of sets in Fx( 2 ). The set Fjf ( 1 ) cannot contain Fi U F3 = Fi U F2 U F3, and since 
Fi U F2 = Fi we see that Fjf(l) C Fjf( 2 ). There exists some word ab G Fx( 2 ) 
such that Fx{ab) = F2 U F3. Clearly Fx{a) is an element of Fx{l) and therefore 
Fx{a) = Fx{xy) for some xy G L 2 {X). But then by Lemma 1431 Fx{xyb) = 
Fx{ab) = F2 U F3, a contradiction since we above noted that F2 U F3 does not 
equal any of Fi, F2, or F3. We have then shown that if any of the follower sets 
Fi,F2, and F3 are contained in one another, X is sofic, and so for the rest of the 
proof assume that no such containments exist. Note that this also implies that if 
any of Fi U F2, Fi U F3, or F2 U F3 contain each other, then the containing set is 
Fi U F2 U F3, which we have assumed is not in Fx(l), Fx( 2 ), or Fx( 3 ). 

We break the remainder of the proof into cases by how many of the sets Fi, F2, 
and F3 are elements of Fx( 2 ). If all three of the sets are elements of Fx{2), then 
X is sohc by Theorem 14.41 We then have three remaining cases. 

Case 1: none of Fi, F^, F3 are in Fx( 2 ). By Lemma [4Tl Fx{2) consists of 
nonempty unions of Fi , F2 , and F3 , and we have assumed that Fi U F2 U F3 is not 
in Fx{2). If |Fx( 2 )| < 2 , then X is sohc by Theorem 14.91 The only possibility is 
then that Fjf( 2 ) = {Fi U F2,Fi U F^,F2 U F3}. Then by Lemma Fx(l) must 
contain supersets of each of these sets, and it cannot contain Fi U F2 U F3. This 
forces Fx(l) to also be {Fi U F2, Fi U F3, F2 U F3}, meaning that Fx( 2 ) = Fx(l), 
and so X is sohc by Theorem 14.41 

Case 2: exactly one of Fi, F2, F3 is in Fx( 2 ). Without loss of generality, 
suppose that Fi G Fx( 2 ) and F2,F3 ^ Fx( 2 ). At least two other sets must be 
elements of Fx( 2 ) or else X is sohc by Theorem 14.91 and they must be unions of 
Fi, F2, and F3 by Lemma [ 4.11 Therefore, Fjf( 2 ) contains at least two of the sets 
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Fi U F 2 , Fi U F 3 , and J ^2 U F 3 . By Lemma 1121 supersets of any such unions are 
also present in Fx(l), which must be the sets themselves since we’ve assumed that 
Fi UF 2 UF 3 ^ Fx(l). If Fi is also in Fjf (1), Fx{2) C Fjf(l), and X would be sofic 
by Theorem 14.41 so Fi ^ Fjf(l). 

Now, let abc be some word such that Fx{abc) = F 2 . What, then, is the fol¬ 
lower set of ab? If it is any set in Fjf(l), then there would exist d so that 
Fx{ab) = Fx{d), and then Fx{abc) would equal Fx{dc) by Lemma [4.31 mean¬ 
ing that F 2 S Fjf(2), a contradiction. So the only choice for Fx{ab) is Fi. Since 
at least two of Fi U F 2 , Fi U F 3 , and F 2 U F 3 are in Fx{2), Fx(2) contains a set of 
the form Fi U F^. Say that Fx[xy) = Fi U F^. Then, Fx{xy) 3 Fx{ab), meaning 
that Fx{xyc) 3 Fx{abc) = F 2 . Since none of the Fi contain each other, this means 
that Fx{xyc) = F 2 . But then since Fi U Fi also is a member of Fx(l), there exists 
2 ; so that Fx{z) = Fi U Fi, and then by Lemma 1121 Fx{zc) = F 2 , a contradiction 
since F 2 ^ Fx(2). Hence, X is sohc in this case as well. 

Case 3: exactly two of Fi, F 2 , F 3 are in Fx{2). Without loss of generality, 
suppose that Fi,F 2 G Fx(2) and F 3 ^ Fjf(2). By Lemma [T2l Fx(2) must contain 
some superset of F 3 which is not Fi UF 2 UF 3 , so it is of the form F 3 UFi for i = 1 or 
2. As in Case 2, any of the sets Fi U F 2 , Fi U F 3 , or F 2 U F 3 which is an element of 
Fx(2) must be in Fx(l) as well. This means that if Fi and F 2 are both in Fx{l), 
then Fx(2) C Fj 5 s:(l) and X would be sohc by Theorem 14.41 so we restrict to the 
case where at least one of these sets is not in Fx{l)- 

Now, let abc be some word such that Fx{abc) = F 3 . As in Case 2, the follower set 
of ab must be some set which occurs in Fx(2) but not Fx{l), which must be either 
Fi or F 2 (depending on which is not part of Fjs:(l)). Without loss of generality, we 
say that Fx{ab) = F 2 . We now show that neither Fi U F 2 nor F 2 U F 3 is in Fx{2). 
Suppose for a contradiction that there is a word xy G L{X) for which Fx{xy) = 
F 2 U Fi, i = 1 or 3. Then, since F{xy) A F{ab) = F 2 , F{xyc) A F{abc) = F 3 . 
Again, since no Fi contains another, this implies that F{xyc) = F 3 . Finally, we 
note that F{y) A F{xy) = F 2 U F^, so F{y) = F 2 U F^. Therefore, by Lemma 021 
F{yc) = F{xyc) = F 3 , but this is a contradiction since F 3 ^ Fx(2). We now know 
that neither Fi U F 2 nor F 2 U F 3 is in Fx(2). By Lemma [4.11 all sets in Fx(2) 
are nonempty unions of Fi, F 2 , and F 3 , and if |Fx(2)| < 3, then X is sofic by 
Theorem 14.91 The only remaining case is then that Fx{2) = {Fi, F 2 , Fi U F 3 }. 

We now consider the sets in Fjc(1). Recall that F 2 ^ Fjc(1) and that Fi U F 3 G 
Fx(l) since Fi UF 3 G Fx(2). If |Fx(l)| = 1, then X is sofic by Theorem l4.5l so we 
can assume that Fjf(l) contains at least one other set, which must be a nonempty 
union of the elements of Fx(2) by Lemma [4.11 The only possibilities are Fi and 
Fi U F 3 , since we assumed earlier that Fi U F 2 U F 3 ^ Fx(2). Therefore, every set 
in Fx(l) is a superset of Fi. 

Our final step will involve considering what happens when a word with follower 
set Fi is extended on the right by a letter. Suppose for a contradiction that there 
exists a word w G L{X) with Fx{w) = Fi and a letter i for which Fx{wi) = F 2 . 
Then, for any letter j, since Fx{j) G Fx{l), Fx{j) A Fx{w) = Fi. Therefore, 
Fx{ji) A Fx{wi) = F 2 . However, the only superset of F 2 in Fx(2) is F 2 itself, 
and so for every j G A, Fx{ji) = F 2 . Finally, note that, by Lemma 0Tl Fxii) = 
Uj Fx{ji) = F 2 , a contradiction since F 2 ^ Fx(l). 
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Similarly, let’s assume for a contradiction that there exists a word w G L{X) 
with Fx{w) = Fi and a letter i for which Fx{wi) = F 3 . Then, choose a letter j 
with Fx{j) = FiU F 3 . Then, since Fx{j) 2 Fx{w) = Fi, Fx{ji) 2 Fx{wi) = F 3 . 
However, the only superset of F 3 in Fx{2) is Fi U F 3 , so Fx{ji) = Fi U F 3 . Then, 
since Fx{j) = Fx{ji) = Fi U F 3 , by Lemma[^ Fxijn) = Fx{ji) = Fi U F 3 , a 
contradiction since Fi U F 3 ^ Fx(3). 

This means that for every word w G L{X) with Fx{w) = Fi and any letter a 
for which wa G L{X), F{wa) = Fi. But then, since the follower set of every letter 
contains Fi, the follower set of every legal 2 -letter word contains Fi, a contradiction 
since Fx{ 2 ) contains F 2 , and we assumed that none of the Fi contains another. 
Every case has either led to a contradiction or to the conclusion that X is sofic, 
and so we’ve proved that X is sofic. 

□ 

Our final result is a version of Conjecture 11.11 for a class of coded subshifts. 
Recall the definition of coded subshifts below. 

Definition 4.11. Given a set W of finite words, the coded subshift with code 
words yV is the subshift generated by taking the closure of the set of all biinfinite 
sequences made from concatenating words in W. 

Theorem 4.12. Given a sofic shift X, choose a subset W C L{X) with the property 
that for any finite word v G L{X), there exists some w GW such that v is a suffix 
of w. Create a coded subshift Y with code words {wc \ w G W} where c is a letter 
not appearing in the alphabet of X. Then if |Fy(n)| < n for any n G N, then Y is 
sofic. 

Proof. We begin with two preliminary observations. Firstly, X C Y, since any 
point of X is a limit of finite words in L(X), all of which are suffixes of code words, 
which are themselves in L{Y). We also note that any word in L{Y) without a c 
must be a subword of a code word, and therefore in L{X). 

Secondly, for any word ucv G L{Y), Fy(ucv) = Fyicv). Clearly Fy{ucv) C 
Fy[cv). Let s G Fy{cv). Because uc is the suffix of a concatentation of code words 
and vs is the beginning of a concatenation of code words in Y, ucvs occurs in Y 
and therefore Fyipicv) 2 Fy{cv). 

We begin our proof by claiming that there are only finitely many follower sets 
in Y of words not containing the letter c. Given any word w G L{Y), if w does 
not contain a c then w G L{X). There are only finitely many follower sets in 
X, so it is sufficient to show that for any w,v G L{X), Fx{w) = Fx{v) implies 
Fy{w) = Fy{v). To that end, let Fx{w) = Fx{v) and consider any s G Fy{w). 
If s does not contain the letter c, then ws is a limit of longer and longer words in 
W, and since all such words are in L{X), ws occurs in X, i.e. s G Fx{w). Since 
Fx{w) = Fx{v), s G Fx{v), i.e. vs also occurs in X. Since Y 2 X, vs occurs in Y 
as well, and so s G Fy{v). 

On the other hand, if s contains the letter c and s G Fy(w), then s = s'cs" for 
some s' not containing c (s' may be the empty word). By the same logic as above, 
ws' G T(X), therefore vs' G T(X), and so vs' occurs as a suffix of some word in 
W. But then, vs'c is a suffix of some code word, and so vs'cs" occurs in Y. 

We have shown that in both cases, s G Fy{w) implies s G Fy{v), and so Fy{w) C 
Fy{v). By the same argument, Fy{v) C Fy{w), giving Fy(w) = Fy{v). Therefore 
there are only finitely many follower sets in Y of words not containing c. 
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Now, we assume that n is such that |JV(r)| < n. Partition Ln(Y) into n + 1 sets 
based on the last appearance of the letter c in the word-the first set consists of 
words with no c, the second set consists of words ending with c, the third S 2 
consists of words ending with c followed by another letter that is not c, and so on, 
up to the final set Sn which consists of words beginning with a c followed by n — 1 
other symbols which are not c. Since X QY, there exist words in L{Y) of every 
length without any c symbols, implying that S'o 7 ^ 0. Therefore, there must exist 
fc > 0 so that all follower sets (in F) of words in Sk are also follower sets (in F) of 
some word in Si for some i < k; else each of the n + 1 sets Si would contribute a 
follower set not in any previous one, contradicting |_FV(n)| < n. 

Let w be a word in L{Y) of length at least k. Our goal is to show that Fy{w) is 
either equal to one of the finitely many follower sets of words without a c or to the 
follower set of a word of length less than k. Clearly, if w does not contain a c, we 
are done, so suppose w contains the letter c. As noted earlier, Fy{w) is unchanged 
if all letters before the last occurrence of c are removed from w. If this removal 
results in a word of length less than k, then again we are done. So let us proceed 
under the assumption that w begins with c, has length k or greater and contains 
no other c symbols. 

Let p denote the fc-letter prefix of w. Since p begins with c, p can be arbitrarily 
extended backwards in any way to yield an n-letter word p' which has the same 
follower set as p. Note that p' £ Sk, and so there exists i < k and p" £ Si so 
that Fy{p) = Fy{p') = Fy{p"). There are two cases. If j 7 ^ 0, then we may again 
remove the letters of p" before the final c symbol to yield a word p'" of length i < k 
for which Fy{p) = Fy{p"'). Then, we replace the prefix p of w by p'" to yield a 
new word w' with strictly smaller length, which still begins with a c and contains 
no other c symbols, and for which Fy{w) = Fy{w') by Lemma lT3l We then repeat 
the above steps. If at each step, i 7 ^ 0, then eventually w will be shortened to a 
word of length at most k with the same follower set in F, of which there are clearly 
only finitely many. 

The only other case is that at some point, the prefix of length k has the same 
follower set in F as a word in Sq . Then, again by Lemma 14.31 that prefix can 
be replaced by the word in Sq, yielding a word with no c symbols with the same 
follower set in F as w. There are only finitely many follower sets in F of words 
not containing c. We have then shown that Fy{w) (for arbitrary w of length at 
least k) has follower set in F from a finite collection (namely all follower sets in F 
of words with no c and all follower sets in F of words with length at most fc — I), 
which implies that F is sofic. 

□ 

Remark 4.13. The class of coded subshifts F which may be created as in Theo¬ 
rem 0311 includes all so-called S-gap shifts [5] (with X = {0°°} and c = I) and the 
reverse context-free shift of [5] (with X = {a, 6 }^ and c as in the theorem). 
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